Search results for "Information criteria"

showing 8 items of 8 documents

Model selection procedure for mixture hidden Markov models

2021

This paper proposes a model selection procedure to identify the number of clusters and hidden states in discrete Mixture Hidden Markov models (MHMMs). The model selection is based on a step-wise approach that uses, as score, information criteria and an entropy criterion. By means of a simulation study, we show that our procedure performs better than classical model selection methods in identifying the correct number of clusters and hidden states or an approximation of them

model selectionclustersinformation criteriaSettore SECS-S/01 - Statisticahidden statesentropy-based scores

researchProduct

Model selection for penalized Gaussian Graphical Models

2013

High-dimensional data refers to the case in which the number of parameters is of one or more order greater than the sample size. Penalized Gaussian graphical models can be used to estimate the conditional independence graph in high-dimensional setting. In this setting, the crucial issue is to select the tuning parameter which regulates the sparsity of the graph. In this paper, we focus on estimating the "best" tuning parameter. We propose to select this tuning parameter by minimizing an information criterion based on the generalized information criterion and to use a stability selection approach in order to obtain a more stable graph. The performance of our method is compared with the state…

Gaussian Graphical ModelInformation Criteria Stability SelectionPenalized likelihoodSettore SECS-S/01 - Statistica

researchProduct

Testing and implementing a new approach to estimating interregional output multipliers using input-output data for South Korean regions

2020

Flegg's location quotient (FLQ) is a useful tool for estimating intraregional output multipliers. This paper uses it as one component when estimating interregional multipliers. Using statistical information criteria and official data for 16 South Korean regions, it is found that the best approach is to combine the FLQ with a simple trade model. The paper explains how the proposed procedure can be implemented for both multiple and individual regions, and also how a region-specific value for the unknown parameter δ in the FLQ formula can be determined. Finally, an illustrative case study of one of the regions is carried out.

sijaintiMathematical optimizationalueelliset erotComputer scienceGeography Planning and Development0211 other engineering and technologiestalousmaantiedealuetutkimusInformation Criteria02 engineering and technologyEtelä-KoreaFlegg's location quotient (FLQ)Component (UML)0502 economics and businessinterregional multipliersEarth and Planetary Sciences (miscellaneous)Economic base analysisHardware_ARITHMETICANDLOGICSTRUCTURES050207 economicspanos-tuotosanalyysiinformation criteriaInput/outputgravity model05 social sciencestuloksellisuus021107 urban & regional planninganalyysimenetelmätGravity model of tradeStatistics Probability and UncertaintyGeneral Economics Econometrics and Finance

researchProduct

A computationally fast alternative to cross-validation in penalized Gaussian graphical models

2015

We study the problem of selection of regularization parameter in penalized Gaussian graphical models. When the goal is to obtain the model with good predicting power, cross validation is the gold standard. We present a new estimator of Kullback-Leibler loss in Gaussian Graphical model which provides a computationally fast alternative to cross-validation. The estimator is obtained by approximating leave-one-out-cross validation. Our approach is demonstrated on simulated data sets for various types of graphs. The proposed formula exhibits superior performance, especially in the typical small sample size scenario, compared to other available alternatives to cross validation, such as Akaike's i…

Statistics and ProbabilityFOS: Computer and information sciencesGaussianInformation CriteriaCross-validationMethodology (stat.ME)symbols.namesakeBayesian information criterionStatisticsPenalized estimationGeneralized approximate cross-validationGraphical modelSDG 7 - Affordable and Clean EnergyStatistics - MethodologyMathematics/dk/atira/pure/sustainabledevelopmentgoals/affordable_and_clean_energyKullback-Leibler loApplied MathematicsEstimatorCross-validationGaussian graphical modelSample size determinationModeling and SimulationsymbolsInformation criteriaStatistics Probability and UncertaintyAkaike information criterionSettore SECS-S/01 - StatisticaAlgorithm

researchProduct

Estimating the Number of Changepoints in Segmented Regression Models: Comparative Study and Application

2020

This paper deals with the problem of selecting the number of changepoints in segmented regression models. The aim is to review selection criteria, namely information criteria and hypothesis testing, and to propose a novel application in the context of students' careers in higher education. The performance of the selection criteria is assessed through simulation studies. Furthermore, we investigate the relationship between University students' performance and one of its main determinants, finding out that this relationship is actually broken-line.

Higher educationComputer sciencebusiness.industryContext (language use)Information CriteriaMachine learningcomputer.software_genreHypothesis testingSegmented regressionChangepointInformation criteriaHigher educationArtificial intelligenceSegmented regressionSettore SECS-S/01 - StatisticabusinesscomputerSelection (genetic algorithm)Statistical hypothesis testingSSRN Electronic Journal

researchProduct

Selecting the tuning parameter in penalized Gaussian graphical models

2019

Penalized inference of Gaussian graphical models is a way to assess the conditional independence structure in multivariate problems. In this setting, the conditional independence structure, corresponding to a graph, is related to the choice of the tuning parameter, which determines the model complexity or degrees of freedom. There has been little research on the degrees of freedom for penalized Gaussian graphical models. In this paper, we propose an estimator of the degrees of freedom in $$\ell _1$$ -penalized Gaussian graphical models. Specifically, we derive an estimator inspired by the generalized information criterion and propose to use this estimator as the bias term for two informatio…

Statistics and ProbabilityStatistics::TheoryKullback–Leibler divergenceKullback-Leibler divergenceComputer scienceGaussianInformation Criteria010103 numerical & computational mathematicsModel complexityModel selection01 natural sciencesTheoretical Computer Science010104 statistics & probabilitysymbols.namesakeStatistics::Machine LearningGeneralized information criterionEntropy (information theory)Statistics::MethodologyGraphical model0101 mathematicsPenalized Likelihood Kullback-Leibler Divergence Model Complexity Model Selection Generalized Information Criterion.Model selectionEstimatorStatistics::ComputationComputational Theory and MathematicsConditional independencesymbolsPenalized likelihoodStatistics Probability and UncertaintySettore SECS-S/01 - StatisticaAlgorithmStatistics and Computing

researchProduct

Statistical relationship between hardness of drinking water and cerebrovascular mortality in Valencia: a comparison of spatiotemporal models

2003

The statistical detection of environmental risk factors in public health studies is usually difficult due to the weakness of their effects and their confounding with other covariates. Small area geographical data bring the opportunity of observing health response in a wide variety of exposure values. Temporal sequences of these geographical datasets are crucial to gaining statistical power in detecting factors. The spatiotemporal models required to perform the statistical analysis have to allow for spatial and temporal correlations, which are more easily modelled via hierarchical structures of hidden random factors. These models have produced important research activity during the last deca…

Statistics and ProbabilityOperations researchComputer scienceEcological ModelingBayesian probabilityBayes factorMarkov chain Monte CarloDeviance (statistics)Information CriteriaStatistical powerDeviance information criterionsymbols.namesakeCovariateStatisticssymbolsEnvironmetrics

researchProduct

Model selection using limiting distributions of second-order blind source separation algorithms

2015

Signals, recorded over time, are often observed as mixtures of multiple source signals. To extract relevant information from such measurements one needs to determine the mixing coefficients. In case of weakly stationary time series with uncorrelated source signals, this separation can be achieved by jointly diagonalizing sample autocovariances at different lags, and several algorithms address this task. Often the mixing estimates contain close-to-zero entries and one wants to decide whether the corresponding source signals have a relevant impact on the observations or not. To address this question of model selection we consider the recently published second-order blind identification proced…

ta112Series (mathematics)Estimation theoryModel selectionasymptotic normalitypattern identificationAsymptotic distributionInformation Criteriaoint diagonalization SOBI AsympBlind signal separationMatrix (mathematics)Control and Systems EngineeringSOBISignal Processingjoint diagonalizationComputer Vision and Pattern RecognitionElectrical and Electronic EngineeringAlgorithmSoftwareMixing (physics)MathematicsSignal Processing

researchProduct